Analysis of Sindhi Spelling Error Patterns for Spelling Error Detection and Correction

نویسندگان

  • Muhammad Umair
  • Mutee U Rahman
چکیده

Statistical analysis of spelling error trends in a language plays important role in automatic spelling error detection and correction. Comprehensive statistical analysis of spelling error trends for Sindhi is still subject of research. This research study identifies and analyses the spelling error trends in Sindhi. The statistical analysis of error trends is based on a real time corpus collected from different sources. A corpus based dictionary is developed and used for identification of errors from a test corpus. Both traditional (insertion, deletion, substitution and transposition) and language specific error trends are identified and analyzed. Errors are categorized into different types along with their statistical results. Reasons of various corpus/language specific error trends are also discussed. Keywords—Spelling error; spelling error patterns; Sindhi spelling errors; error pattern analysis;

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Spelling Error Trends and Patterns in Sindhi

Statistical error Correction technique is the most accurate and widely used approach today, but for a language like Sindhi which is a low resourced language the trained corpora’s are not available, so the statistical techniques are not possible at all. Instead a useful alternative would be to exploit various spelling error trends in Sindhi by using a Rule based approach. For designing such tech...

متن کامل

Design and implementation of Persian spelling detection and correction system based on Semantic

Persian Language has a special feature (grapheme, homophone, and multi-shape clinging characters) in electronic devices. Furthermore, design and implementation of NLP tools for Persian are more challenging than other languages (e.g. English or German). Spelling tools are used widely for editing user texts like emails and text in editors.  Also developing Persian tools will provide Persian progr...

متن کامل

Joint English Spelling Error Correction and POS Tagging for Language Learners Writing

We propose an approach to correcting spelling errors and assigning part-of-speech (POS) tags simultaneously for sentences written by learners of English as a second language (ESL). In ESL writing, there are several types of errors such as preposition, determiner, verb, noun, and spelling errors. Spelling errors often interfere with POS tagging and syntactic parsing, which makes other error dete...

متن کامل

A Survey of Spelling Error Detection and Correction Techniques

Spelling Correction is a process of detecting and sometimes providing suggestions for incorrectly spelled words in a text. Spell Checker is an application program that flags words in a document that may not be spelled correctly. Spell Checker may be stand-alone capable of operating on a block a text such as word processor, electronic dictionary. When some text is given as an input to spell chec...

متن کامل

Spelling Error Patterns in Spanish for Word Processing Applications

This paper reports findings from the elaboration of a typology of spelling errors for Spanish. It also discusses previous generalizations about spelling error patterns found in other studies and offers new insights on them. The typology is based on the analysis of around 76K misspellings found in real-life texts produced by humans. The main goal of the elaboration of the typology was to help in...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2013